Longest Common Circular Chains of Maximal Unique Matches between Bacterial Genomes

نویسندگان

  • Frédéric Guyon
  • Serge Hazout
  • Alain Guénoche
چکیده

The aim of this study is to compare complete genomes to identify preserved large DNA fragments in order to analyse evolution process. For this purpose, we propose an efficient method to identify conserved regions between multiple genomes. We first transform each complete genome into a permutation of Maximal Unique Matches (MUMs). Secondly, we compute the longest common sequence of MUMs in the same order in all the genomes. This permits to define conserved genome segments as long DNA fragments having MUMs in the same order and to analyse evolution events as reversal or transposition of this fragments.Finally, we propose some genomic distances based on MUMs and conserved genome segments length and number.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Comparing Bacterial Genomes by Searching Their Common Intervals

Comparing bacterial genomes implies the use of a dedicated measure. It relies on comparing circular genomes based on a set of conserved genes. Following this assumption, the common interval appears to be a good candidate. For evidences, we propose herein an approach to compute the common intervals between two circular genomes that takes into account duplications. Its application on a concrete c...

متن کامل

Evaluation of First and Second Markov Chains Sensitivity and Specificity as Statistical Approach for Prediction of Sequences of Genes in Virus Double Strand DNA Genomes

Growing amount of information on biological sequences has made application of statistical approaches necessary for modeling and estimation of their functions. In this paper, sensitivity and specificity of the first and second Markov chains for prediction of genes was evaluated using the complete double stranded  DNA virus. There were two approaches for prediction of each Markov Model parameter,...

متن کامل

Alignment-free detection of local similarity among viral and bacterial genomes

MOTIVATION Bacterial and viral genomes are often affected by horizontal gene transfer observable as abrupt switching in local homology. In addition to the resulting mosaic genome structure, they frequently contain regions not found in close relatives, which may play a role in virulence mechanisms. Due to this connection to medical microbiology, there are numerous methods available to detect hor...

متن کامل

A genomic distance based on MUM indicates discontinuity between most bacterial species and genera.

The fundamental unit of biological diversity is the species. However, a remarkable extent of intraspecies diversity in bacteria was discovered by genome sequencing, and it reveals the need to develop clear criteria to group strains within a species. Two main types of analyses used to quantify intraspecies variation at the genome level are the average nucleotide identity (ANI), which detects the...

متن کامل

andi: Fast and accurate estimation of evolutionary distances between closely related genomes

MOTIVATION A standard approach to classifying sets of genomes is to calculate their pairwise distances. This is difficult for large samples. We have therefore developed an algorithm for rapidly computing the evolutionary distances between closely related genomes. RESULTS Our distance measure is based on ungapped local alignments that we anchor through pairs of maximal unique matches of a mini...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2003